3574 results found.
Written
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:On the Role of Style in Parsing Speech with Neural Models
-
Paper track:12.10 Metadata for ling./discourse structure (disf/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | Treebank-3 | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC User Agreement for Non-Members
Size:
59 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Linguistically-informed Training of Acoustic Word Embeddings for Low-resource Languages
-
Paper track:12.4 Spoken term detection/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zixiaofan Yang | The Switchboard-1 Telephone Speech Corpus | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC97S62/
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
None Production Status:
Existing-used
Use:
Speech Disorders
-
Paper title:Say what? A dataset for exploring the error patterns that two ASR engines make
-
Paper track:13.6 Voice quality characterization for clinical v/Oral Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Meredith Moore | TORGO | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
60 hours Production Status:
Existing-used
Use:
Voice Control
-
Paper title:Say what? A dataset for exploring the error patterns that two ASR engines make
-
Paper track:13.6 Voice quality characterization for clinical v/Oral Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Meredith Moore | UASPEECH | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
22 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Say what? A dataset for exploring the error patterns that two ASR engines make
-
Paper track:13.6 Voice quality characterization for clinical v/Oral Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Meredith Moore | Common Voice | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Say what? A dataset for exploring the error patterns that two ASR engines make
-
Paper track:13.6 Voice quality characterization for clinical v/Oral Presentation
-
Paper status:Accept Special Session
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Meredith Moore | TIMIT Acoustic-Phonetic Continuous Speech Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
MIT
Size:
10 GByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:A scalable noisy speech dataset and online subjective test framework
-
Paper track:6.3 Noise reduction for speech signals/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ross Cutler | Microsoft Scalable Noise Speech Dataset | /N |
Documentation:
English
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons Attribution-ShareAlike
Size:
40000 sentences Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Language learning using Speech to Image retrieval
-
Paper track:10.1 Multimodal systems/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Danny Merkx | Flickr Audio Caption Corpus | /N |
Documentation:
D. Harwath and J. Glass, "Deep Multimodal Semantic Embeddings for Speech and Images," 2015 IEEE Automatic Speech Recognition and Understanding Workshop, pp. 237-244, Scottsdale, Arizona, USA, December 2015
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
8000 sentences Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:Analyzing Verbal and Nonverbal Features for Predicting Group Performance
-
Paper track:11.5 Analysis of verbal, co-verbal and nonverbal b/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gabriel Murray | The Group Affect and Performance (GAP) Corpus | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Portuguese Romanian Russian Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
500 hours Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Adapting Transformer to End-to-End Spoken Language Translation
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mattia A. Di Gangi | MuST-C | /N |
Documentation:
None




